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Abstract — We present a method to eliminate redundancy in 
the transition tables of Boolean automata: schema redescription 
with two symbols. One symbol is used to capture redundancy of 
individual input variables, and another to capture permutability 
in sets of input variables: fully characterizing the canalization 
present in Boolean functions. Two-symbol schemata explain 
aspects of the behaviour of automata networks that the char- 
acterization of their emergent patterns does not capture. We use 
our method to compare two well-known cellular automata rules 
for the density classification task |1|: GKL |2|, |3| and GP |4|. 
We show that despite having very different emergent behaviour, 
these rules are very similar. Indeed, GKL is a special case of GP. 
Therefore, we demonstrate that it is more feasible to compare 
cellular automata via schema redescriptions of their rules, than 
by looking at their emergent behaviour, leading us to question 
the tendency in complexity research to pay much more attention 
to emergent patterns than to local (micro-level) interactions. 

I. Introduction 

The intersection of biology and computer science has been 
fertile for some time. Indeed, von Neumann was a member 
of the mid-twentieth century Cybernetics group (5), whose 
main focus was the understanding of natural and artificial 
systems in terms of communication and control. Most early 
computer science developments were inspired by the models 
of cognition that orbited this group (6). Since then, the need 
to understand how biological systems are able to control 
and transmit information throughout the huge number of 
components that comprise them has only increased. The study 
of complex network dynamics has received much attention in 
the last two decades. From pioneering work on networks of 
automata (7)-(9) to recent models of genetic regulation fTO)- 
JT3) , it is clear that to understand and control the biological 
organization, it is useful to study models based on complex 
networks of automata fl4)-||T6l. 

There has been much progress in understanding the structure 
of natural networks — at the level of their topology fT7) , |18| 
or of their more fine-grained motifs (19) — as well as on 
modelling biological systems as networks of automata. Yet, 
we are still to fully grasp how complex networks "compute" 
and how to harness them to perform specific tasks |20|. The 
need for a better understanding of collective computation in 
complex networks has been identified in many areas. For 
instance, the way plants adjust stomatal apertures for efficient 
gas exchanges on leaf surfaces is statistically indistinguishable 



from the dynamics of automata that compute pT) , and the 
high degree of connectivity in biochemical intracellular signal 
transduction networks endows them with the capability of 
emergent nontrivial classification — via collective computation 
p2| . However, not much work in automata models of complex 
networks has focused on how interactions at the local level of 
components are linked to collective behaviour. 

We show that eliminating redundancy in the transition tables 
of local automata reveals the loci of control of collective 
dynamics, better than via the characterization of collective 
behaviour, emergent patterns, and the like. Our mechanism 
to eliminate redundancy is schema redescription (23), (24), 
which we expand here with an additional symbol as explained 
below. Schemata remove redundancy from automata transition 
tables, leaving only input states, or groups of input states that 
control state-transitions, known as their canalizing inputs |25|. 
Moreover, we discuss how schema redescription can reveal 
the links between local and collective behaviour in automata 
networks (here cellular automata). We exemplify our approach 
with the classical problem of collective computation in cellular 
automata: the density classification task (T). 

II. Cellular Automata 

A n-state automaton is an integer variable x € {0, n— 1}, 
whose value (or state) is computed at time t + 1 via a state- 
determined transition function F, which takes as input the state 
of a set of n automata at time t. When i = 0we refer to it as 
being in the quiescent state; any other state is regarded as an 
active state. To study the dynamics of systems of many simple 
elements (known as complex systems), it is common to study 
networks of interconnected automata. One way to do this is 
to arrange automata in a regular lattice of N cells, which we 
refer to as cellular automata, a formalization of a complex 
dynamical system with its origins in the work of Ulam and 
von Neumann (26) . In a homogeneous cellular automaton 
(CA), which we use here, each cell is defined by the same 
K-state automaton and is connected to n neighbour cells in the 
lattice (including itself). Therefore, there are k" possible local 
neighbourhood configurations (LNCs). The transition function 
for every cell, F — {fj}, can thus be defined as a look-up table 
(LUT) where each entry, fj,j G {1, n n }, is an assignment 
of a possible input LNC, or condition part, to an output state 



Xi(t + 1) € {0, k — 1}. In binary CAs (k = 2) it is possible 
to classify individual cell state transitions in three categories: 
(1) preservations, where a cell does not change its state in 
the next time step, i.e. Xi(t) — Xi(t + 1); (2) generations, in 
which the cell goes from the quiescent to the active state; and 
(3) annihilations, where the cell goes from the active to the 
quiescent state. The initial configuration (IC) of states of a 
CA lattice is typically random. In the CAs we consider here, 
cells in the lattice update their states synchronously. 

III. Schema Redescription 

We previously proposed schema redescription with one 
symbol as a method to relate an automaton's LUT to the col- 
lective dynamics it produces in one- and two-dimensional CA 
lattices p3| , J24) . Here we introduce two-symbol schemata. 
The basic idea is simple: LUTs are compressed into par- 
simonious schemata with two extra symbols that capture 
redundant input states. One symbol captures the irrelevance 
of some inputs in some situations; the other the permutability 
of subsets of inputs. Only the input states that effectively 
determine the automaton's state transitions remain. A related 
notion of canalizing inputs exists in the context of modelling 
biochemical networks, e.g. those that regulate the expression 
patterns of genes. Indeed, automata with canalizing inputs have 
been shown to stabilize the dynamics of Boolean models of 
genetic regulation fT3) , |25|, \21\. 



A. Wildcard Schemata 

To unpack the notion of schema, consider a Boolean au- 
tomaton x, whose state-transition function depends on the 
states of six Boolean inputs (possibly other automata). Figure 
[T|\ depicts the subset of LUT entries (f a ) for automaton x, 
that lead to state-transition x(t + 1) = 1. 

We first compress LUT entries using schemata that are like 
LUT entries, but allow an additional wildcard (#) symbol 
(represented graphically in grey) to appear in their condition 
part. A wildcard input means that any valid state is accepted 
for it, with no effect on the state-transition. This results in a 
redescription of the set of LUT entries, F, for an automaton x 
of n inputs into a smaller set of wildcard schemata, F' = {/£} 
(see Figure [TJi). The set F' , for a given automaton also 
contains the original entries in its LUT, F, that could not 
be redescribed by wildcard schema. Each specific wildcard 
schema f' v redescribes a subset of entries in the original LUT, 
denoted by T v = {f a : f a s— > (>— > means "is redescribed 
by"). Wildcard schemata are minimal in the sense that none 
of the (non-wildcard) inputs in the condition part of a schema 
can be "raised" to the wildcard status and still ensure the 
automaton's transition to the same state. Because wildcard 
schemata are minimal, it follows that for any two wildcard- 
schemata f' v and fi, T v T^AT^ ^ T v . Because wildcard 
schemata are minimal and unique, they are equivalent to the 
set of prime implicants obtained during the first step of the 
Quine & McCluskey Boolean minimization algorithm |29|. 
In other words, a schema is unique in the sense that the 
subset of LUT entries it redescribes is not fully redescribed 



by any other schema. However, in general T„ n ^ 0. 
This means that schemata can overlap in terms of the LUT 
entries they describe. In Figure [I] Ti = {fx, f 5 , / 9 , /i 3 } and 
T 9 = {/ 4 , fc, h, / 7 }, therefore T x n T 9 = {/ 5 }. 

Our redescription methodology derives from the work of 
John Holland on condition/action rules to model inductive 
reasoning in cognitive systems (28). However, the same idea 
had been developed previously for the minimization of circuits 
in electrical engineering J29) . It was also used by Valiant 
1 30 1 when introducing Probably Approximately Correct (PAC) 
learning. 
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Fig. 1 . A set of LUT entries with transition to 1 , redescribed. White (black) 
states are (1). Wildcards are grey. Notice that Tg = {/4, /s, f&, /y}. 
Using the additional position-free symbol, the entire set F' is compressed 
into a single two-symbol schema: /" . Any permutation of the inputs with the 
position-free symbol results in a schema in F'. Since there is only one set of 
marked inputs, the position-free symbol does not require an index. 

A wildcard schema can always be expressed as a logical 
conjunction of literals (logical variables or their negation). 
Since such a schema is a prime implicant, it follows that 
all of its literals are essential to determine the automaton's 
state transition. We refer to the literals (input states) in a 
wildcard schema as its essential input states or enputs for 
short. For instance, schema f 9 of the example of figure [I] has 
4 enputs: the first 4 cells in its condition part. The set F' can 
be expressed as a logical disjunctive normal form (DNF) — that 
is, a disjunction of conjunctions. 

B. Two-Symbol Schemata 

Here we introduce a further redescription of F' that results 
in a set of two-symbol schemata F" = {/g } (Figure [jjT). 
The position-free symbol (o m ) above inputs in a condition 
part means that any pair of inputs thus marked can "switch 
places" without affecting the automaton's state-transition. 
The index of the position-free symbol, when necessary, is 
used to differentiate among distinct subsets of inputs whose 



elements can only switch places with each other. A two- 
symbol schema fg redescribes a set Og = {f a : f a >— > fg} 
of LUT entries of x. In the same way a wildcard schema 
captures a subset T v of LUT entries, a two-symbol schema 
redescribes a subset of wildcard schemata [] Therefore f' e ' 
captures the degree of symmetry or group invariance in the 
set of wildcard schemata contained in Q' g |3l"[ . We refer to 
the subsets of such marked inputs in a two-symbol schema as 
group-invariant enputs, which may include wildcard symbols 
marked with the position-free symbol. Since each two-symbol 
schema compresses a number of wildcard schemata, it follows 
that for a given automaton x, \F"\ < \F'\ < \F\ = 2 k . 

After redescription, all redundancy is removed in the form 
of the two symbols. In a given schema, a wildcard input alone 
is not essential by definition: it is never an enput, while an 
individual input set up to a Boolean state always is. However, 
when a wildcard input is marked with a position-free symbol, 
then it is part of a group-invariant enput defined by the set of 
inputs thus marked. In this case, the enput identifies a group 
invariance property, such as "as long as two of these inputs 
are false". To summarize, the set of wildcard schemata F' 
of an automaton x captures its enputs, while the addition of 
the position-free symbol captures group invariant relationships 
in subsets of wildcard schemata. In our working example, 
the resulting two-symbol schema (Figure [T| reveals that the 
automaton's transition to true is determined only by a subset 
of its six inputs: as long as inputs 2 and 3 are false and true, 
respectively, and among the others at least one is true and 
another is false, the automaton will transition to true. Note 
that such minimal input constraints are not obvious by looking 
at the LUT of the example automaton. 

In previous work, we used wildcard schemata to identify 
conceptual properties of known solutions to a well-studied 
problem that requires non-trivial collective computation: the 
density classification task in homogeneous cellular automata. 
By uncovering these properties, we were able to devise a 
genetic algorithm that operated on a search space of wildcard 
schemata, rather than the space of LUTs. We found that 
the space of schemata is much more amenable to search, 
with very good correlations among good solutions to the 
problem. This way, we found some of the best solutions to 
this problem J23) , J24) . Our aims here are: (1) introduce two- 
symbol schemata; (2) demonstrate that they are more useful 
to explain and control the function and behaviour of CAs than 
existing techniques to analyse collective behaviour, and (3) 
it is more feasible to compare automata transition functions 
via their schemata redescriptions, than by looking at their 
emergent or collective behaviour. 

'Given a two-symbol schema, f'J, the wildcard schemata it redescribes 
can be denoted by the set It is useful to look at set 0^ as a matrix 
where rows correspond to wildcard schemata, and columns to their inputs. 
The last column is the state transition of fg, and may therefore be omitted. 
The wildcard-schemata rows of matrix 0^ have the same number of zeroes, 
ones and wildcards. For any pair of inputs in f g ' that are marked with 

the same position-free symbol, the corresponding i th and j th columns of 61 
can be switched, and the rows of the resulting matrix reordered to obtain 0^ 
again. 



IV. The Density Classification Task (DCT) 

The density classification task (DCT) is the most studied 
example of collective computation in CAs. The goal is to find 
a binary CA that can classify the majority state in a random 
IC. If the majority of cells in the IC is in the quiescent (active) 
state, after a number of time steps the lattice should converge 
to a homogeneous state where every cell is in the quiescent 
(active) state. Since the outcome could be undecidable in 
lattices with even number of cells (N), lattices with an odd 
number of cells are used instead. Devising CA rules that 
perform this task is not trivial, because cells in a CA lattice 
update their states based on local neighbourhood information, 
the states of 7 input Boolean variables in this case. Therefore, 
this task requires information to be transferred across time 
and (lattice) space to achieve a correct global classification. 
The performance of CA rule F, on C initial configurations of 
length N is denoted by V%{F). The definition of the DCT 
used in our studies comes from (32|. 

V. Explanation in Complex Systems 

The complex systems community has produced method- 
ologies to characterize space-time dynamics in CAs to study 
emergent behaviour, often using the DCT as a test-case |20|. 
The most well-known such methodology is the computational 
mechanics framework (CM) [33], [34], used to extract regular 
domains: repeating patterns in the space-time dynamics, for- 
mally defined as sets of regular languages. It has been argued 
that in order to explain emergent behaviour, we should look at 
the boundaries between domains, known as particles, as this is 
where information is exchanged in the CA lattice [35) . There- 
fore, collective behaviour in CA dynamics is characterized by 
finite catalogues of particles and their interaction rules, which 
define the emergent computation implemented by a given CA 
| |20| , and can be seen to possess some features of linguistic 
representation, though not in fully symbolic forms |36) . Figure 
[2] depicts the catalogue of particles for the .Fgkl rule J2J, J3], 
which we describe below. 

Other methods exist to detect regularities in CA space- 
time dynamics, typically statistical and information theoretical 
filters that uncover patterns that correspond to the particles 
found via CM [37|, [38]. But when it comes to understanding 
and controlling collective behaviour, a key issue is concerned 
with finding the best explanatory device to use. Do we focus 
exclusively on the emergent patterns of collective dynamics 
(the "emergentist" paradigm), as CM and related methodolo- 
gies do? Or can we gain explanatory advantages by paying 
more attention to the lower level of local transition functions? 

In the remaining of this article we pursue the second 
approach, towards a hierarchical methodology that seeks 
to link local interactions to global behaviour, ultimately 
allowing us to better understand and control the collective 
dynamics of complex systems modelled as automata networks. 
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Fig. 2. Catalogue of regular domains, particles (domain boundaries), partic 
velocities (in parentheses), and particle interactions seen in -Fqkl 's spac 
time behaviour. The colouring assigned to each particle is then used in Figu 
[3] The notation p ~ A x A y means that p is the particle forming the bounda 
between regular domains A x and A v . (Adapted from Q.) 

VI. Comparing Collective Dynamics 

Most of the known one-dimensional CA rules for the DC 
were analysed using wildcard schemata in (39). We four 
that most of these CAs are process symmetric: they classil 
equally well ICs with majority 0s and with majority Is. V\ 
also found that two of the most well-known high-performir 
CA rules for the DCT, while observing very distinct collecth 
behaviour, are nonetheless similar at the local LUT level |39| 
More specifically, we refer to the human-derived CA Fqkl [2 
J3j, and Fq P , derived via genetic programming |4|. Figun 
[3] and [4] depict the space-time dynamics of Fgkl an d Fq 
respectively. The performance of these rules on the DCT i 
V121(F GK l) « 0.815, and F GP Vl2l(F CP ) « 0.822. In tf 
remainder of this article, instead of Fgkl we use the full 
equivalent Fgkls which is the mirror rule of Fqkl- That i 
exactly the same CA rule, but where the particles traverse tl 
lattice in the opposite direction, with the same velocities ar 
interactions (see computationally equivalent CAs in [39 1). 

The collective behaviour of Fgkl' and Fq P is quite distinc 
while -Fqkl' > after a short transient, leads to the appearance ( 
three domains, the dynamics of the Fgp leads to the appearanc 
of many more (we have identified at least ten p6| , see figuj 
|4}. Because Fgkl' only has three domains, from the CI 
perspective, its collective dynamics can be characterized ar 
explained with a catalogue of six particles: the permutatioi 
without repetition of two out of three possible domains, whet 
order is relevant Indeed, the collective behaviour of Fgs 
has been well predicted by the CM framework, using on] 
six particles (40}, (41]. In contrast, -Fgp produces many mot 
particles; even with just ten domains (it has more), there woul„ 
be ninety possible particles and many potential interactions. 
Interestingly, even though the collective dynamics of the 
two rules is apparently (via the CM framework) completely 
different, we found that Fgp can be easily derived from Fgkl, 
via a small sequence of operations on the wildcard schemata 
representation of the latter. 

The wildcard schemata for -Fgkl' are shown in Figure [5] 
Note that most schemata are state-preserving: all state-changes 
in this automaton are described by two wildcard schemata 
{f' b and f[ ). Generations (annihilations) occur only when a 

2 p = d\j (d — 2)!, where d is the the number of domains. 
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Fig. 3. (Top) A space-time diagram produced by Fqkl ■ (Bottom) The 
diagram with the regular domains filtered out, leaving only the locations of 
the particles (coloured using the scheme in Figure [2j. Note that particle a 
(red) lasts for only one time step, after which it decays to particles S and fi. 



cell in state (1) observes right (left) and right-most (left- 
most) neighbours in the opposite state — this is characterized 
by schemata f' b and /{ . In all other situations the a cell Xi(t + 
l)= Xi (t). 

The wildcard schemata for Fgp are shown in Figure [6] In 
this case, all state changes are characterized by three wildcard 
schemata: /{, f$, and /g for annihilation, and /g, /g, and f' w 
for generation. In all other situations the a cell Xi(t + 1) = 
Xi(t). Using the the two-symbol schemata introduced here, 
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Fig. 4. A space-time diagram produced by Fqp 
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Fig. 5. Wildcard schemata of Fqkl' ■ Schemata / g and fl Q describe all the 
possible state-changes for a cell; all other situations are preservations. Notice 
that schemata fL , , or fg , when applicable to a state-change, describe 
LUT entries already described by fL or /|0, respectively; When prescribing 
preservations the same schemata are captured with less enputs by /{ , /3 , /g 
or /g . This means that f% , , or fg are not essential schemata. 



we can appreciate better the similarity between the two rules. 
Figure [7] depicts the essential state-changing transitions in 
both rules, which we obtained by via two-symbol schema- 
redescription, thus removing all redundancy from local dy- 
namics. We can now, for the first time, understand that Fqp 
is doing something very similar to, and which includes the 
behaviour of, Fgkl' at local level. 

Both CA rules can be understood by a single schema for 
generation and their process-symmetric schema for annihi- 
lation. Each such schemata is characterized by two enputs. 
Moreover, one of the enputs is exactly the same for both 
CAs: for the generation schema, both CAs depend on the 
right-most neighbour being 1, and for annihilation schemata, 
both CAs depend on the left-most neighbour being 0. The 
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Fig. 6. Wildcard schemata for -Fop. /(, f^, /g, /g, fg, and f' 1Q describe all 
the possible state-changes for a cell Xi\ all other situations are preservations. 
Notice that schemata and /g, when applicable to a state-change, describe 
LUT entries already described by /g or / 5 and /g or /( , respectively 



second (group-invariant) enput in Fqp is also very similar 
to, and indeed contains the second enput for, Fqkl'- F° r 
the generation schema, Fqkl' depends on the immediate-right 
neighbour being 1, and for the annihilation schema, Fgkl' 
depends on the immediate-left neighbour being 0. When it 
comes to Fqp, the second enput is a group-invariant enput: 
one of three possible input cells in the neighbourhood may 
be specified (set to or 1). Specifically, For the generation 
schema, Fqp depends not only on the right-most neighbour 
being 1, like Fqkl'> but alternatively, on the immediate-left, 
immediate-right or left-most neighbours being 1 . Likewise, for 
the annihilation schema, Fqp depends not only on the left-most 
neighbour being 0, like Fq KL >, alternatively, on the immediate- 
left, immediate-right or right-most neighbours being 0. 
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Fig. 7. Two-symbol schemata for -Fgkl' (Fgkl) an d Fqp, characterizing 
every possible situation where there are state changes for these rules. Note 
that here we include the state of the updating cell, (even if not an enput in 
the original schemata) to describe the known state-changing transitions. 

VII. Connecting the Levels 

The similarity between the local-level descriptions of Fgkl' 
and Fqp we discovered, as well as their similar performance 
leads to a natural question: do the additional regular domains 
produced by the Fqp, play a significant role in its collective 
computation of the DCT? The following three facts are known: 
(1) Fqp produces the same three regular domains as Fqkl', 
plus many others; (2) the performance difference between 
the two rules, while significant, is less than 1%; and (3) the 
number of possible particles grows very rapidly with number 
d of domains (d\/(d— 2)!). Therefore, it is reasonable to con- 
clude that many — even perhaps most of the regular domains — 
produced by Fqp must be redundant; i.e. the particles they 



produce lead to interactions that have no effect on information 
transmission across the lattice. 

Support for this conclusion comes from different sources. 
One is simple inspection of the space-time dynamics pro- 
duced by Fcp. Figure [8] depicts a portion of the space-time 
dynamics for F^p shown in Figure [4] Notice, for example, 
the collision between the particle formed by domains A 5 and 
A 3 (highlighted in green), and the particle formed by domains 
A 5 and A 6 (orange). This collision, in the CM framework 
perspective, leads to two new particles: one formed by domains 
A 6 and A 4 , and another formed by domains A 3 and A 4 . 
However, this collision seems to be redundant, leading to 
particles that behave exactly as the colliding particles (same 
velocity, possibly same structure). This is highlighted in Figure 
[8] by using the same colours for the particles observed before 
and after the collision. This suggests that the (large) set of 
possible regular domains in the dynamics of Fq P contains sets 
of domains that are equivalent, such that collisions among the 
particles they produce are redundant. In the example above A 5 
and A 6 seem to be equivalent, just like A 3 and A 4 . 
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Fig. 8. In Fqp there are particles that produce equal particles after collisions 
(Highlighted in green (vertical) and orange/purple (inclined) 

Our observation is supported further when looking at the 
schemata for Fqp. We can use the knowledge encoded in 
schemata, to better understand the collective behaviour of the 
CA. Consider f[ in Fgp, which enforces that a cell Xi in 
any state whose left- and right-most neighbours are will 
transition to, or stay at t+1 (see Figure|7]i. This leaves a wide 
"gap" of five cells in-between the 0-cells, which can be in any 
state, without affecting the state-transition of Xi to 0. When 
this pattern is repeated in the lattice, i.e. every 6 th cell is 0, 
using the two-symbol schemata obtained instead of the original 
LUT, we can compute the resulting collective dynamics with 
incomplete information. Indeed, one key advantage of schema 
redescription is that it allows the simulation of the dynamics 
of automata networks from ICs where only the state of some 
cells is specified, while the state of the others is unknown. 

Figure [9] (A) depicts the known dynamics given partial 
lattice information, where every sixth cell is (except at the 
lattice borders) and the state of grey cells is unknown. This 
allows us to know how the initial input pattern will propagate 
in the CA dynamics. In this case, our lack of knowledge 
propagates in the lattice, and our partial knowledge of the 
dynamics eventually disappears completely. In|9](B) we show 
that different such patterns can co-exist independently, in the 
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Fig. 9. Dynamically decoupled dynamics of patterns in -Fop. See text for 
description. 



previous unknown segments of the lattice. The difference from 
(A) is that every third cell in the left half of the initial lattice 
is set to 1, and the third and every third cell after that is set to 
0, in the right half of the initial lattice (these conditions match 
schemata j' 7 and j' 2 respectively). 

One very interesting observation from computing dynamics 
with incomplete information, is that many patterns (such as 
those in Figure |9]l are dynamically decoupled in Fcp. This 
means that the space-time patterns produced from schemata 
/{, /2 and jlj in Fcp are updated independently: they function 
as building blocks that can switch places, producing different 
domains (from the CM perspective) that in reality are formed 
of some of the same sub-patterns (defined by the same 
schemata). This is clear in Figure [9] (B) where the patterns 
from schemata f[,f2 and /{, coexist independently of one 
another. Since some of the schemata for Fcp have large "gaps" 
wherein the states of the cells are irrelevant, if particles collide 
inside those gaps, some of the decoupled underlying patterns 
are unaffected, leading to equivalent domains and redundant 
collisions such as what we apparently observe in Figure [8] 

In contrast, rule Jgkl' does not easily lead to dynamic 
decoupled patterns and redundant collisions, since its schemata 
do not leave large enough "gaps" in the lattice for other pat- 
terns to occur. Therefore, F^kl' presents much fewer domains 
than rule Fcp. But since the domains obtained by rule Fq P 
are the result of schemata with a large amount of canalization 
leading to large "gaps" that can be in any state, many of the 
obtained distinct domains are dynamically equivalent. 

This observation is also supported by the schema re- 
description of additional, better-performing rules for the 
DCT: F MM o 80 2 (23) and F w0 J42) , with performances 
) « 0.845 and Vl2g(F w0 ) ~ 0.889. The two- 
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symbol (state-changing) schemata for these rules are depicted 



in Figure 10 As we can see, for the DCT, increasingly better- 
performing rules become "less canalized". This means that 
they are very sensitive to noise in ICs (as we would expect 
for the DCT): bit-flips or incomplete state-specification have a 
high chance of leading to wrong classification (in DCT terms). 
Since noise is not used in their performance computation and 



fitness evaluations, these rules find advantages in becoming 
increasingly specific to the initial conditions they respond dif- 
ferently to. Therefore, they have a higher number of schemata 
with more enputs and therefor less canalization to describe 
their behaviour. The dynamics of these CAs do not produce 
the large numbers of domains as observed in F G p. [^] 
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Fig. 10. 2-symbol schemata of i*MM0802 an d -Fwo characterizing every 
possible situation where there are state changes for this rule. 

VIII. Discussion 

Two-symbol schema redescription allowed us for the first 
time to understand that despite dramatically distinct collective 
dynamics (emergent behaviour), the Fqp rule is a more general 
version of the -Fgkl' rule. The statistically significant, albeit 
small, performance improvement of Fqp over -Fgkl'> is gained 
by converting a single enput, into a group-invariant enput. 
In other words, Fqp introduced ambiguity to the dynamical 
behaviour ]43) of -Fgkl', by allowing the state-transition to 
optionally occur for a larger set of configurations of a cell's 
neighbourhood. This redundancy in the form of ambiguity 
is quite effectively captured by our two-symbol schema re- 
description with the position-free symbol. 

It should be emphasized that until this point, and especially 
when looking only at the emergent behaviour of these rules, 
their deep similarity was unknown. With schema redescription 
of local-level dynamics, not only can we uncover the similar- 
ity, but we can explain it. It also makes sense that Fqp was 



obtained via genetic programming [4|. This rule was probably 
evolved via minor changes to a tree encoding of -Fgkl; it would 
need simply to substitute a tree terminal for a sub-tree with a 
disjunction node for 3 terminal edges, plus a mirror operation 
to go from Fgkl to F GK l'- 

We can also see that both rules are extremely canalized: 
every possible state-change is controlled by two enputs, dis- 
regarding the states of all other neighbouring cells. But the 
i*cp is canalizing in yet another way: it depends, alternatively, 
on the state of one of the input cells in a group-invariant 
enput. These group-invariant (collective) enputs can be for- 
mally described as a disjunction of alternative input variables 
(logical literals) and captured in schemata redescription by 
the additional position-free symbol. This additional form of 
canalization in Boolean functions, and its effect on determin- 
ing collective dynamics in automata networks used to model 
genetic regulation and signalling networks is a topic we will 
explore further, elsewhere. 

Another key idea we will explore in future work is how 
to use the explanatory power of schema redescription at the 
local level of Boolean LUTs, to link local to global behaviour 
in automata networks. From what we presented here, we can 
already see that the Fgp rule must necessarily lead to some 
equivalent domains and redundant particle interactions. Since 
it contains group-invariant enputs, it canalizes distinct and 
alternative neighbourhood input patterns to the same transition. 
Therefore, domains that seem to be distinct at the collective 
space-time behaviour level, can in reality be equivalent and 
not involved in transmitting novel information through the 
lattice, merely maintaining the same information in different- 
looking domains. In a forthcoming paper we will build up from 
schema redescription at the local level, and the computation 
of collective dynamics from incomplete information, to auto- 
matically identify domains (relevant and redundant), particles, 
and particle interactions at the collective dynamics level. 

As a final topic of discussion, the present work moves us to 
consider that an exclusively emergentist approach to complex 
systems dynamics is incomplete. The induction of recurring 
patterns in dynamics is not a substitute for uncovering the 
lower-level mechanisms that actually dictate the dynamics. A 
full account of complex systems requires attention to local and 
global dynamics. In a sense, too much attention to the "spots" 
or "stripes" we observe at the emergent level of behaviour, 
can mask that underneath it all there may be a very similar 
animal. 
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